Corpus: urd_news_2007_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 20 25 29 30 31
1000 94 245 309 320 323
10000 358 1677 2763 3162 3228
100000 1347 8526 18847 25591 28134
1000000 1347 8526 18847 25591 28134


Zipf's diagram for sentence endings


Gnuplot diagram

1990 msec needed at 2018-03-30 09:17